Machine Learning Approaches to CAPTCHA Recognition Requiring Minimal Image Processing

ثبت نشده
چکیده

This study focuses on a machine learning approach to the CAPTCHA recognition problem that requires minimal preprocessing of inputs. Both the feed-forward neural nets and the self-organizing maps used in this study took the raw image pixels as input with only simple segmentation and translation performed in advance. The models were trained and tested on four and five letter CAPTCHAs using either letters A-G or AZ, with both models achieving greater than 90% accuracy on four letter CAPTCHAs of letters A-G. Accuracy decreased with the increase in CAPTCHA length, a result of increased crowding of the letters that reduced the performance of the segmentation algorithms. These results show that the difficulty in CAPTCHA recognition is not in the OCR, but rather in the image segmentation. Future work will focus on the image segmentation problem, in addition to ensemble techniques to increase the accuracy of the current models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SEIMCHA: a new semantic image CAPTCHA using geometric transformations

As protection of web applications are getting more and more important every day, CAPTCHAs are facing booming attention both by users and designers. Nowadays, it is well accepted that using visual concepts enhance security and usability of CAPTCHAs. There exist few major different ideas for designing image CAPTCHAs. Some methods apply a set of modifications such as rotations to the original imag...

متن کامل

OTRCaptcha: A Novel Object and Text Recognition Based Image CAPTCHA

CAPTCHA is an important technology to prevent auto-script attack. Currently most of the CAPTCHA systems are text based, which firstly distort, rotate different characters and then use some obfuscation, aiming to make the text difficult to be recognized by autoscript while still can be learnt easily by real users. However, such kind of CAPTCHA schema either too simple, which can be attacked easi...

متن کامل

Introducing New Trends for Persian CAPTCHA

To distinguish between human user and computer program to enhance security, a popular test called CAPTCHA is used on Web. CAPTCHA has an important role in preventing Denial Of Service (DOS) attacks in computer networks. There are many different types of CAPTCHA in different languages. Due to the expansion of Persian-language and documents on internet, creating a suitable Persian CAPTCHA seems t...

متن کامل

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

Breaking CAPTCHAs with Convolutional Neural Networks

This paper studies reverse Turing tests to distinguish humans and computers, called CAPTCHA. Contrary to classical Turing tests, in this case the judge is not a human but a computer. The main purpose of such tests is securing user logins against the dictionary or brute force password guessing, avoiding automated usage of various services, preventing bots from spamming on forums and many others....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008